PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa00578s040.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family GRAS
Protein Properties Length: 597aa    MW: 66877.5 Da    PI: 4.8415
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa00578s040.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS443.21.9e-1352275963374
            GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlt 97 
                     + L++cA+a+s+g+ e+a +++++l++++s +gdp qR+aay++e+Laar+a s++ +y+al+++e +   s+e+laa+++++ev+P++kf++l+
  Csa00578s040.1 227 QILISCARALSEGKSEEALSMVNELRQIVSIQGDPSQRIAAYMVEGLAARMAASGKFIYRALKCKEPP---SDERLAAMQVLFEVCPCFKFGFLA 318
                     78*****************************************************************9...9*********************** PP

            GRAS  98 aNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledle 190
                     aN aI ea++gee+vHiiDfdi+qG Q+++L++++a+ p++ p+lR+Tg+++pes+  s   l+ +g rL+++A+  gv+f+f++ v ++++ ++
  Csa00578s040.1 319 ANGAIIEAIKGEEAVHIIDFDINQGNQYMTLIRSIAELPGKRPRLRLTGIDDPESVqrSIGGLSIIGLRLEQLAKDHGVSFKFKA-VPSKTSIVS 412
                     ******************************************************9988899************************.7******** PP

            GRAS 191 leeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikv 285
                     +++L +kpgE+l+Vn+++qlh+++desv++ ++rde+L++vksl+Pk+v+vveq++++n+++F+ rf+ea eyysa+fdsl+++lpres+er++v
  Csa00578s040.1 413 PSTLGCKPGETLIVNFAFQLHHMPDESVTTVNQRDELLHMVKSLNPKLVTVVEQDVNTNTSPFFSRFVEAYEYYSAVFDSLDMTLPRESQERMNV 507
                     *********************************************************************************************** PP

            GRAS 286 ErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                     Er++l+r+ivn+vaceg+er+er e ++kWr+r+++aGF+p p+s++++++++ l+++ + + y+++ee g+l ++W++++L+++SaWr
  Csa00578s040.1 508 ERQCLARDIVNIVACEGEERIERYEAAGKWRARMMMAGFSPKPMSSRVTNNIQNLIKQQYCNNYKLKEEMGELHFCWEEKSLIVASAWR 596
                     ************************************************************888*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.767199576IPR005202Transcription factor GRAS
PfamPF035146.4E-133227596IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 597 aa     Download sequence    Send to blast
MVEQTVVREH IKARIMSLVR SAEPSSYRNP KLYSLNENVN NIGGVTSAQI FDQDRSKNPC  60
LTDDSYPSQS YEKYFLDSPT DEFVQQHPIG SGASVSSFGS LDSFPYQSRP VLGCSMEFQL  120
PFDSTTSTSS TRPLGGYQAV SYSPSMDVVE EFDDEQMRSK IQELERALLG DEDDKMVGVD  180
NLMEIDNEWS YQNESEQHQD SPKESSSADS NSHVSSKEVV SQTTPKQILI SCARALSEGK  240
SEEALSMVNE LRQIVSIQGD PSQRIAAYMV EGLAARMAAS GKFIYRALKC KEPPSDERLA  300
AMQVLFEVCP CFKFGFLAAN GAIIEAIKGE EAVHIIDFDI NQGNQYMTLI RSIAELPGKR  360
PRLRLTGIDD PESVQRSIGG LSIIGLRLEQ LAKDHGVSFK FKAVPSKTSI VSPSTLGCKP  420
GETLIVNFAF QLHHMPDESV TTVNQRDELL HMVKSLNPKL VTVVEQDVNT NTSPFFSRFV  480
EAYEYYSAVF DSLDMTLPRE SQERMNVERQ CLARDIVNIV ACEGEERIER YEAAGKWRAR  540
MMMAGFSPKP MSSRVTNNIQ NLIKQQYCNN YKLKEEMGEL HFCWEEKSLI VASAWR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A5e-572275966375GRAS family transcription factor containing p
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0154470.0AC015447.8 Arabidopsis thaliana chromosome I BAC F24J8 genomic sequence, complete sequence.
GenBankAY0458330.0AY045833.1 Arabidopsis thaliana putative scarecrow 1 protein (At1g21450) mRNA, complete cds.
GenBankAY0966260.0AY096626.1 Arabidopsis thaliana unknown protein (At1g21450) mRNA, complete cds.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010496506.10.0PREDICTED: scarecrow-like protein 1
RefseqXP_010496505.10.0PREDICTED: scarecrow-like protein 1
SwissprotQ9SDQ30.0SCL1_ARATH; Scarecrow-like protein 1
TrEMBLD7KK120.0D7KK12_ARALL; Putative uncharacterized protein
STRINGfgenesh2_kg.1__2335__AT1G21450.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM69532744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21450.10.0SCARECROW-like 1